Bipolar Person Name Identification of Topic Documents Using Principal Component Analysis

نویسندگان

  • Chien Chin Chen
  • Chen-Yuan Wu
چکیده

In this paper, we propose an unsupervised approach for identifying bipolar person names in a set of topic documents. We employ principal component analysis (PCA) to discover bipolar word usage patterns of person names in the documents and show that the signs of the entries in the principal eigenvector of PCA partition the person names into bipolar groups spontaneously. Empirical evaluations demonstrate the efficacy of the proposed approach in identifying bipolar person names of topics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Person Name Identification in Chinese Documents Using Finite State Automata

This research is about automatic identification and extraction of person names in Chinese text documents. Solutions to this problem have immediate and extensive applications in many areas especially in Web Intelligent Agents related applications such as Web search engines, Web data mining, and automatic Web information analysis. We have noted that while finite state automata (FSA) based techniq...

متن کامل

Prediction of mineral deposit model and identification of mineralization trend in depth using frequency domain of surface geochemical data in Dalli Cu-Au porphyry deposit

In this research work, the frequency domain (FD) of surface geochemical data was analyzed to decompose the complex geochemical patterns related to different depths of the mineral deposit. In order to predict the variation in mineralization in the depth and identify the deep geochemical anomalies and blind mineralization using the surface geochemical data for the Dalli Cu-Au porphyry deposit, a ...

متن کامل

Identification of mineralization features and deep geochemical anomalies using a new FT-PCA approach

The analysis of geochemical data in frequency domain, as indicated in this research study, can provide new exploratory informationthat may not be exposed in spatial domain. To identify deep geochemical anomalies, sulfide zone and geochemical noises in Dalli Cu–Au porphyry deposit, a new approach based on coupling Fourier transform (FT) and principal component analysis (PCA) has beenused. The re...

متن کامل

Topic-Specific Scoring of Documents for Relevant Retrieval

There has been mixed success in applying semantic component analysis (LSA, PLSA, discrete PCA, etc.) to information retrieval. Here we combine topic-specific link analysis with discrete PCA (a semantic component method) to develop a topic relevancy score for information retrieval that is used in post-filtering documents retrieved via regular Tf.Idf methods. When combined with a novel and intuit...

متن کامل

Developing and Validation of Moral Behavior Styles Inventory

Article history: Received date: 13 September, 2016 Review date: 2 October 2016 Accepted date:20 November 2016 Printed on line: 5 January Purpose: The present study was done to introduce an efficient tool in the field of moral behavior. Material & Method: method of the study was correlational, its approach was test developing and its population was students of Islamic Azad University- Ast...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010